Influence analysis of Github repositories

نویسندگان

  • Yan Hu
  • Jun Zhang
  • Xiaomei Bai
  • Shuo Yu
  • Zhuo Yang
چکیده

With the support of cloud computing techniques, social coding platforms have changed the style of software development. Github is now the most popular social coding platform and project hosting service. Software developers of various levels keep entering Github, and use Github to save their public and private software projects. The large amounts of software developers and software repositories on Github are posing new challenges to the world of software engineering. This paper tries to tackle one of the important problems: analyzing the importance and influence of Github repositories. We proposed a HITS based influence analysis on graphs that represent the star relationship between Github users and repositories. A weighted version of HITS is applied to the overall star graph, and generates a different set of top influential repositories other than the results from standard version of HITS algorithm. We also conduct the influential analysis on per-month star graph, and study the monthly influence ranking of top repositories.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On GitHub's Programming Languages

GitHub is the most widely used social, distributed version control system. It has around 10 million registered users and hosts over 16 million public repositories. Its user base is also very active as GitHub ranks in the top 100 Alexa most popular websites. In this study, we collect GitHub’s state in its entirety. Doing so, allows us to study new aspects of the ecosystem. Although GitHub is the...

متن کامل

GitHub and Stack Overflow: Analyzing Developer Interests Across Multiple Social Collaborative Platforms

Increasingly, software developers are using a wide array of social collaborative platforms for software development and learning. In this work, we examined the similarities in developer’s interests within and across GitHub and Stack Overflow. Our study finds that developers share common interests in GitHub and Stack Overflow; on average, 39% of the GitHub repositories and Stack Overflow questio...

متن کامل

The Promises and Perils of Mining GitHub (Extended Version)

With over 10 million git repositories, GitHub is becoming one of the most important sources of software artifacts on the Internet. Researchers mine the information stored in GitHub’s event logs to understand how its users employ the site to collaborate on software, but so far there have been no studies describing the quality and properties of the available GitHub data. We document the results o...

متن کامل

OASIS Open Repositories: Overview

As documented in the resources cited below, any qualifying OASIS Technical Committee (a TC using the Non-Assertion [1], RF on Limited Terms [2], or RF on RAND [3] IPR Mode) may request the creation of one or more Open Repositories to enable development of additional material — to supplement or support its standards work. Open Repositories are set up as GitHub projects under the GitHub organizat...

متن کامل

Applying Link Prediction for Repository Recommendation on GitHub

GitHub is one of the world’s most popular platforms for open source software development. As different developers have different expertise and interests, given information about the repositories to which a user has contributed, it may be useful to suggest “similar” repositories that the user may wish to contribute to. For our project, we attempt to solve this problem by performing link predicti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2016